Statistical Significance in Inductive Learning

نویسندگان

  • Olivier Gascuel
  • Gilles Caraux
چکیده

Inductive learning systems search for regularities that therefore be applied with some assurance to an example describe environmental observations, These systems often use which does not belong to the learning set. In other numeri~~l heu~stics to guide this search, The~ also sele~t words, statistical significance may be used to assess regulantles which are good, or the best, according to certain that a learnin g Program reall y learned th ' d d ' d , 1 ' , S " , al h 1 " d some mg an 1 numenca cntena, tatlstlc measures ave recent y game t I ' d t f t 1 " , th AI ' Th ' " d " 1 no on y provl e ar e ac s, popu anty m e communIty, IS paper provi es a simp e method for fully exploiting statistical measures, We give a test However, there is a difficulty. Let us continue our which may be used to decide whether a given regularity is previous example. A classification rule provided by a statistically significant, or, in other words, whether this regularity system is found by optimizing a given measure in the may be distinguished from random" This method directly uses learning set. Therefore, the score obtained in the results obta~ed ~rom the learning set with~ut ~qui,ring any test learning set by this rule with this measure, inevitably set. An application to Concep~al Oustenng IS given and ,the provides an optimistically biased view of the real performance of the method IS evaluated by a numencal " " , ul ' P" all ' d f 1 d d ' d performance of the rule, The same holds when CrIterIa slm atlon, In y, we provi e Ie erences to Ie ate stu les an, , d b tabl t " wlth related semantIcs are used, e. g. entropic and some e a e ques Ions. likelihood ratio criteria in CN2. Moreover, because of the (large) number of possible rules, there are always

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A New Intrusion Detection Method Based on Data-Oriented Classification of Attacks

The most acute problem for misuse detection method is its inability to detect new kinds of attacks. A new detection method based on data-oriented classification of attacks is proposed to solve this problem. After analyzing its significance, a practical scheme which uses relevant feature subset codes clustering is designed. Applying Concept Hierarchy Generation for attack Labels (CHGL), inductiv...

متن کامل

A Survey on Inductive Learning

Inductive learning is the method of learning from observations. Inductive learning has important applications over a wide range of area including pattern recognition, language acquisition, bio-informatics and intelligent agent design. Because of such diverse applicability, inductive learning methods including automata learning, grammar induction, hidden markov model learning and symbolic statis...

متن کامل

Probabilistic Inductive Logic Programming

Probabilistic inductive logic programming, sometimes also called statistical relational learning, addresses one of the central questions of artificial intelligence: the integration of probabilistic reasoning with first order logic representations and machine learning. A rich variety of different formalisms and learning techniques have been developed. In the present paper, we start from inductiv...

متن کامل

The Variable Precision Rough Set Inductive Logic Programming Model — a Statistical Relational Learning Perspective

The Variable Precision Rough Set Inductive Logic Programming model (VPRSILP model) extends the Variable Precision Rough Set (VPRS) model to Inductive Logic Programming (ILP). The VPRSILP model is considered from the Statistical Relational Learning perspective, by comparing and contrasting it with Stochastic Logic Programs.

متن کامل

An Inductive Logic Programming Approach to Statistical Relational Learning

. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1

متن کامل

Logic, Probability and Learning, or an Introduction to Statistical Relational Learning

Probabilistic inductive logic programming (PILP), sometimes also called statistical relational learning, addresses one of the central questions of artificial intelligence: the integration of probabilistic reasoning with first order logic representations and machine learning. A rich variety of different formalisms and learning techniques have been developed and they are being applied on applicat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1992